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Vehicle mounted controller for air conditioner, analyses state of input voice command, if input 
voice command is unrecognizable by voice recognition unit, and accordingly changes color of 
icon currently displayed on display device 
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Vehicle mounted controller for air conditioner, analyses state of input voice command, if input 
voice command is unrecognizable by voice recognition unit, and accordingly changes color of 
icon currently displayed on display device Alerting Abstract ...NOVELTY - A control unit (2) 
analyses the state of input voice command, if the input voice command is unrecognizable by a voice 



recognition unit (5). The control unit changes the color of the icon currently displayed on the screen 

of an externally connected display unit (6), based on ADVANTAGE - Improves voice 

recognition rate and increases control operation efficiency 5 voice recognition unit... Class 

Codes International Patent Classification IPC Class Level Scope Position Status Version Date 

...G10K)015/00 G10L-0015/22 G10L-0015/28 G10M)021/00 . G10L-00 15/00... 

...G10L-0021/00 Original Publication Data by AuthorityOriginal AbstractsrA vehicle mounted 
control apparatus includes voice recognition section (5) for recognizing a voice command that a 
user inputs by means of microphone (8) connected to outside; and a control section (2) that analyzes 
a cause of incapability of recognition of the voice command when the voice command cannot be 
recognized by the voice recognition section (5) and changes the display format of a screen of 

display device (6) connected to outside on the basis of results of the analysis A vehicle mounted 

control apparatus includes voice recognition section for recognizing a voice command input by an 
voice input device; and a control section that analyzes a cause of incapability of recognition of the 
voice command when the voice command cannot be recognized by the voice recognition section 

and gives a notice on result of the analysis A vehicle mounted controller comprising a voice 

recognition means for recognizing the voice instruction of a user inputted from an externally- 
connected voice input means, and a control section for analyzing the cause when a voice is 
unrecognizable by the voice recognition means and altering the display format of the screen of an 

externally-connected display means based on the results of analysis reconnaissance vocale 

capable de reconnaitre une instruction donnee oralement par un utilisateur depuis un dispositif 
d 1 entree vocale connecte avec Texterieur, et une partie de commande en mesure d analyser la cause 
de la non-reconnaissance d'une voix par le dispositif de reconnaissance vocale et de modifier le 
format de Tecran d'un dispositif d'affichage connecte avec Texterieur compte tenu des resultats de 
Tanalyse. Claims: A vehicle mounted control apparatus comprising:voice recognition section (5) for 
recognizing a voice command that a user inputs by means of voice input device (8) connected to 
outside; anda control section (2) that analyzes a cause of incapability of recognition of the voice 
command when the voice command cannot be recognized by the voice recognition section (5) 
and changes a display format of a screen of display device (6) connected to outside on the basis of a 
result of the analysis... 
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Electronic device e.g. instruction code generation apparatus displays text data in 
synchronization with output of audio data, in synchronous display unit 
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Electronic device e.g. instruction code generation apparatus displays text data in 
synchronization with output of audio data, in synchronous display unit Original 
Titles:ELECTRONIC EQUIPMENT, INSTRUCTION CODE GENERATING DEVICE AND 
ELECTRONIC EQUIPMENT CONTROL PROGRAM, INSTRUCTION CODE GENERATION 
PROCESSING PROGRAM Alerting Abstract ...NOVELTY - A detector detects the pointer 
specified position of the text data displayed in a display unit (18). A matching unit matches the 
specified position of text data with audio data output by an audio output unit. A synchronous display 
unit (G) displays the text data is synchronization with output of audio data. ... instruction code 
generation apparatus; instruction code generation processing program; and electronic device 

control program USE - Electronic device e.g. instruction code generation apparatus (claimed) 

for synchronizing output of text data with audio data in karaoke apparatus ADVANTAGE - 

The correlation information for performing synchronous regeneration of audio and text files, is 

generated easily DESCRIPTION OF DRAWINGS - The figure shows the text-audio 

synchronous display in the electronic device 18 display unit G synchronous display 

screen Title Terms .../Index Terms/Additional Words: DISPLAY; Class Codes International Patent 

Classification IPC Class Level Scope Position Status Version Date ...G10L-0013/00 G10L- 

002 1/06 G 10L-00 13/00 G 1 0L-002 1/00 
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Voice control system for strobe light, has output unit to extract recognition word 
corresponding to code, and generate message utilizing extracted recognition code and message 
is notified to user through display 
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Voice control system for strobe light, has output unit to extract recognition word 
corresponding to code, and generate message utilizing extracted recognition code and message 
is notified to user through display Alerting Abstract ...system (10) has a function execution unit to 
execute a function corresponding to a command code from the unit. A message output unit extracts a 
recognition word corresponding to a word code from a storing unit and generates a message that 
utilizes the extracted recognition code. The generated message is notified to a user through a 
display (4) or a speaker (5). ...ADVANTAGE - The system enables the uttered speech by the user to 
be properly included in the notified message, such that the user is prevented from misunderstanding 

that the uttered speech might be mis- recognized 4 Display 8 Yaw rate sensor (10) Voice 

recognition system Title Terms .../Index Terms/Additional Words: DISPLAY Class Codes 
International Patent Classification IPC Class Level Scope Position Status Version Date G10L- 

015/00... Main ...G10L-015/22 G10L-015/28 Original Publication Data by AuthorityOriginal 

Abstracts: A voice recognition unit recognizes a recognition word code corresponding to a speech 
uttered by a user along with a command code for executing a function of a control object. A function 
execution unit executes a function corresponding to the command code from the voice recognition 
unit. A message outputting unit extracts a recognition word corresponding to the recognition word 
code from a recognition word storing unit, and generates a message properly utilizing the extracted 
recognition word. The generated message is notified to the user via a display or a speaker, so that 
the user can confirm that the uttered speech is adequately recognized. A voice recognition unit 
recognizes a recognition word code corresponding to a speech uttered by a user along with a 
command code for executing a function of a control object. A function execution unit executes a 
function corresponding to the command code from the voice recognition unit. A message outputting 
unit extracts a recognition word corresponding to the recognition word code from a recognition word 
storing unit, and generates a message properly utilizing the extracted recognition word. The 
generated message is notified to the user via a display or a speaker, so that the user can confirm 
that the uttered speech is adequately recognized. >Claims:What is claimed is:l. A voice control 
system comprisingrvoice inputting means for inputting a speech uttered by a user;recognizing 
means for recognizing the uttered speech as a speech content;command code storing means for 
storing, as a pair, the speech content and a command code used for executing a function of a 
control object, wherein the command code is correlated with the speech content and has one-to- 
one or one-to-many correspondence with the speech content;extracting means for extracting, 
from the command code storing means, the command code based on the speech content as 
which the uttered speech is recognized when the speech is inputted;controlling means for 

executing the function based on the message generating means for generating a message 

corresponding to the executed function when the function is executed through the command 
code; andnotifying means for notifying the generated message,wherein the message generating 



means generates the message by utilizing the recognized speech content in a case that the 
command code has one-to-many correspondence with the speech contentWhat is claimed is:l. 
A voice control system comprising:voice inputting means for inputting a speech uttered by a 
user; recognizing means for recognizing the uttered speech as a speech content;command code 
storing means for storing, as a pair, the speech content and a command code used for executing 
a function of a control object, wherein the command code is correlated with different speech 
contents for the uttered speech and the command code has one-to-one or one-to-many 
correspondence with the speech content;extracting means for extracting, from the command 
code storing means, the command code based on the speech content as which the uttered 
speech is recognized when the speech is inputted;controlling means for executing the function 
based on the command code extracted by the extracting means;message generating means for 
generating a message corresponding to the executed function when the function is executed 
through the command code; andnotifying means for notifying the generated message,wherein 
the message generating means generates the message by utilizing the recognized speech content 
in a case that the command code correlates to different speech contents.> 
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Software code comment management system performs browsing, editing, grouping and 
updating of code comments input as voice message by programmer and establishes correlation 
between code comments and related comment objects 
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Original Titles:Software code comments management method and system supporting voice 

recognition technology Software code comments management method and system supporting 

speech recognition technology Alerting Abstract ..NOVELTY - A multimedia user interface 
module (100) has speech recognition unit (102) that translates programmer's voice messages into 



code comments. A management module (101) performs browsing, editing, grouping and updating of 
the code comments and establishes correlation between code comments and related comment objects. 

The comment data and control information are USE - For managing comments of software code 

using voice recognition technology 102 speech recognition module... Class Codes International 

Patent Classification IPC Class Level Scope Position Status Version Date ...G10L-0013/04... 

G10L-0015/20 G10K)015/22 G10L-0015/26 G10L-0013/00 G10L-0015/00 

...G10L-0015/00 Original Publication Data by Authority... Original Abstracts:code and compose 
voice comments. These comments, divided into help comments, test items and variable comments, 
are subsequently recorded, stored, analyzed, prescribed and displayed using text to speech and voice 
recognition software. It is therefore possible to define test cases, execute vocal follow up on 

changes, and listen to comments and variables values, while running a program time and run 

time. A code writer is hereby enabled to simultaneously write code and compose voice comments. 
These comments, divided into help comments, test items and variable comments, are subsequently 
recorded, stored, analyzed, prescribed and displayed using text to speech and voice recognition 
software. It is therefore possible to define test cases, execute vocal follow up on changes, and listen 
to comments and variables values, while running a program Claims: What is claimed is:l. A 
software code comments management system operating with and within a software code 
development environment and able to interact dynamically with the code segments, variables and 
characteristics, supported by voice recognition technology comprising: a multimedia user interface 
that includes a graphic user interface that responds to voice commands;a comment management 
component, for managing the comments;a speech recognition module;a database system for 
storing system data and control information; andan output mechanism for outputting the comments 

or comments-code interaction results code development environment and able to interact 

dynamically with the code segments, variables and characteristics, the system comprising: a 
multimedia user interface that includes a graphic user interface that responds to voice commands;a 
comment management component, for managing the comments;a speech recognition module 
operatively connected to said multimedia user interface^ database system for storing system data 
and control information; andan output mechanism for outputting the comments or comments-code 
interaction results;wherein said comment management component enables classifying comments 
into comments categories from the group comprising of: Help comments, Testing comments and 

Watch variable comments; wherein said testing comments are used operating with and within a 

software code development environment and able to interact dynamically with the code segments, 
variables and characteristics, the system comprising: a multimedia user interface that includes a 
graphic user interface that responds to voice commands;a comment management component for 
managing a plurality of comments;a speech recognition module operatively connected to said 
multimedia user interfaces database system for storing system data and control information; andan 
output mechanism for outputting the comments or comments-code interaction results;wherein said 
comment management component enables classifying comments into comments categories from the 
group comprising of: Help comments, Testing comments and Watch variable comments;wherein 
said watch variable comments enable follow up of variables and expressions values 
including: adding to a comment at least one reference to code variables, to be displayed for tracing 
and debugging; adding a description for... 
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Dynamic display method for speech recognition system, involves providing single floating 
window for displaying dynamically varying frames of speech recognition system state 
information to user 



Patent Assignee: INT BUSINESS MACHINES CORP (IBMC) 
Inventor: GOMEZ F 

Patent Family ( 2 patents, 1 countries ) 



Patent Number 


Kind 


Date 


Application 
Number 


Kind 


Date 


Update 


Type 


US 20030088413 


Al 


20030508 


US 2001992816 


A 


20011106 


200353 


B 


US 7099829 


B2 


20060829 


US 2001992816 


A 


20011106 


200657 


E 



Priority Applications (no., kind, date): US 2001992816 A 2001 1 106 

Patent Details 



Patent Number 


Kind 


Lan 


Pgs 


Draw 


Filing Notes 


US 20030088413 


Al 


EN 


9 


4 





Dynamic display method for speech recognition system, involves providing single floating 
window for displaying dynamically varying frames of speech recognition system state 
information to user Original Titles:Method of dynamically displaying speech recognition system 

information Method of dynamically displaying speech recognition system information Alerting 

Abstract ...NOVELTY - The method involves providing a floating window for displaying frames of 
speech recognition system state information to a user. Frames are varied according to trigger events 
detected in speech recognition system. In responsive to said trigger event, the float window is 
dynamically updated. A single graphical user interface is configured to display all context dependent 
frames. DESCRIPTION - An INDEPENDENT CLAIM is included for a single graphical user 
interface configured to display all context dependent frames of selected items of speech recognition 

system USE - Used for speech recognition systems ADVANTAGE - Dynamic display 

method reduces screen clutter arising from the use of multiple windows while still providing users 

with needed information by interactive navigation 100 Speech recognition system (SRS... 

...110 Speech recognition engine Title Terms .../Index Terms/Additional Words: DISPLAY; Class 
Codes International Patent Classification IPC Class Level Scope Position Status Version Date 

. G10L-0011/00 G10L-0015/22 G10L-0021/00 ...G10L-00 11/00 G10L-0015/00... 

...G10L-0021/00 Original Publication Data by AuthorityOriginal Abstracts: The present invention 
provides a method of dynamically displaying speech recognition system information. The method 
can include providing a single floating window for displaying frames of speech recognition system 
state information to a user. The frames can be varied according to trigger events detected in the 
speech recognition system. Each frame can differ from others of the frames according to the speech 

recognition system state information The present invention provides a method of dynamically 

displaying speech recognition system information. The method can include providing a single 
floating window for displaying frames of speech recognition system state information to a user. The 
frames can be varied according to trigger events detected in the speech recognition system. Each 



frame can differ from others of the frames according to the speech recognition system state 
information. Claims: What is claimed is: 1. A method of dynamically displaying speech recognition 
system information comprising: providing a single floating window for displaying frames of speech 
recognition system state information to a user; and varying said frames according to trigger events 
detected in said speech recognition system, wherein each said frame differs from others of said 
frames according to said speech recognition system state information. What is claimed is: 1. A 
method of dynamically displaying speech recognition system information comprising: providing a 
single floating window for displaying frames of speech recognition system state information to a 
user;performing an empirical analysis of a plurality of user actions and inputs to determine a 
plurality of different trigger events that cause operating states of the speech recognition system to 
change in response thereto; anddynamically updating said single floating window for individually 
and sequentially displaying said frames at different times, each individual display changing in 
response to said different trigger events detected in said speech recognition system, wherein each 

said individually displayed frame differs from others of said frames recognition system state 

information, wherein the frames variably and individually displayed in the single floating window 
include a frame containing a list of valid speech recognition commands for a current speech 
recognition system state and a frame containing a list of alternative text selections for a previously 
spoken word for which a speech-recognition operation has been performed, wherein each variably 
and individually displayed frame is uniquely associated with a specific trigger event, and wherein at 
least one of said trigger events is an automatic user-independent event. 

13/3,K/6 (Item 6 from file: 350) Links 
Fulltext available through: Order File History 
Derwent WPIX 

(c) 2008 The Thomson Corporation. All rights reserved. 

0013422841 Drawing available 
WPI Acc no: 2003-*5 13434/200348 
Related WPI Acc No: 2007-71 6894 
XRPX Acc No: N2003-407604 

Control device or enabling user to communicate with computer, has prompt unit for using 
speech input data to provide prompt to user to assist user in formulating spoken command that 
initiates use of access data 
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...Original Titles: Control apparatus for enabling a user to communicate by speech with a processor 
-controlled apparatus Alerting Abstract ...NOVELTY - A prompt unit uses speech input data to 
provide prompt to a user to assist the user in formulating spoken command that initiates use of 
access data, when determined location of access data and speech input data within a screen 
corresponding to a location defined by location data. ... with prompt to assist user in formulating the 
word or words to be spoken when user directs his or her attention to a location of display screen at 
which speech input is possible... Class Codes International Patent Classification IPC Class Level 

Scope Position Status Version Date G10L-021/00 Main ...G10L-00 11/00 G10L-0015/00... 

G10L-0021/00 G10L-0021/00 G10L-0021/06 ..G101^0011/00 G10L-0015/00 ... 

...G10L-0021/00 G10L-0021/00 G10L-0021/00 Original Publication Data by 

AuthorityOriginal Abstracts: A control apparatus is provided for enabling a user to communicate by 
speech with processor-controlled apparatus (1) having a display (44) for displaying screens to a 
user, an audio input (42) for receiving speech data from the user and a pointing device (40) for 
providing data for enabling the screen location at which the user's attention is directed to be 
determined. The control apparatus has a screen data provider (100) providing screen data for a 
display screen, the screen data including access data, location data defining a location within the 
screen associated with the access data and speech input data for assisting a user in formulating a 
spoken command or commands to initiate use of the access data; a screen data displayer (100) for 
causing the display (44) to display a screen representing the screen data to the user; a location 

determiner (100) for determining from position data provided by the position data provider the A 

control apparatus controls the display of text data which includes a speech link that can be activated 
by spoken command. The shape of a pointing device cursor displayed on a display is then changed 
by the apparatus when the pointing device cursor is located over the speech link included in 

displayed text data. The apparatus is to output a prompt identifying speech commands that can 

be used to activate the speech link if the pointing device cursor is displayed on a display located over 
the speech link in a changed state for a predetermined time. Claims: 1. Control apparatus for enabling 
a user to communicate by speech with processor-controlled apparatus having a display for 
displaying screens to a user, an audio input means for receiving speech data from the user and a 
position data providing means for providing data for enabling the screen location at which the user's 
attention is directed to be determined, the control apparatus comprising: screen data providing means 
providing screen data for a display screen, the screen data including access data, location data 
defining a location within the screen associated with the access data and speech input data for 
enabling a user to formulate a spoken command or commands to initiate use of the access data;screen 
data display control means for causing the display to display a screen representing the screen data to 
the user;location determining means for determining from position data provided by the position data 

providing means the The invention claimed is: 1 . A control apparatus for enabling a user to 

communicate by speech with a processor-controlled apparatus, comprising: display control means 
for controlling a display of text data which includes a speech link that can be activated by a spoken 
command; cursor control means for changing a shape of a pointing device cursor displayed on a 
display when the pointing device cursor is located over the speech link; andprompt output means for 
outputting a prompt identifying speech commands that can be used to activate the speech link when 
the pointing device cursor is displayed on the display in a changed state for a predetermined time 



located over the speech link. 
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Speech input system for searching information using portable terminal, has speech portal 
server that recognizes speech input from portable terminal as common and object text for 
searching information 
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Speech input system for searching information using portable terminal, has speech portal 
server that recognizes speech input from portable terminal as common and object text for 
searching information Alerting Abstract ...NOVELTY - A speech portal server (50) recognizes 
the speech input from a portable terminal (10) as text. The recognized text is separated into 
common text and object text. An application service provider (60) searches the information based on 

the separated command text and object text received from the speech portal server, and provides 

ADVANTAGE - Optimal information is provided to the user by providing speech portal server for 
recognizing input speech for searching information from application service provider... Class Codes 
International Patent Classification IPC Class Level Scope Position Status Version Date G10L- 

011/00 G10L-021/00 Main ...G10L-0015/00 G10l^0015/00 G10L-0015/22 G10L- 

0015/28 G10L-0021/00 . G10L-0015/00 G10L-0015/00 G10L-0021/00 G10L- 

0021/00 Original Publication Data by Authority.. .Original Abstractsrthrough speech, and receiving 
a service from a provider for providing information, the speech input systeim comprises speech 
input/output mean, and an access status display mean, a speech portal server (50) provided with a 
speech recognizing mean to recognize it as a text, a command converting mean for checking the 
recognized text with a command text dictionary, and separating it into a command text and an 
object text, and a conversation control mean for having an access to, and receiving a service from a 

provider which provides different information based on the separated information, and telephone 

information, the speech input system comprises speech input terminals 10, 30 provided with a speech 
input/output mean, and an access status display mean, a speech portal server 50 provided with a 
speech recognizing mean for receiving a speech to recognize it as a text, a command converting 
mean for checking the recognized text with a command text dictionary, and separating it into a 
command text and an object text, and a conversation control mean for having an access to, and 

receiving a service from a provider which provides different information based on the separated 

information, and telephone information, the speech input system comprises speech input terminals 
10, 30 provided with a speech input/output mean, and an access status display mean, a speech portal 
server 50 provided with a speech recognizing mean for receiving a speech to recognize it as a text, 
a command converting mean for checking the recognized text with a command text dictionary, and 



separating it into a command text and an object text, and a conversation control mean for having an 
access to, and receiving a service from a provider which provides different information based on the 

separated information, and telephone information, the speech input system comprises speech 

input terminals 10, 30 provided with a speech input/output mean, and an access status display mean, 
a speech portal server 50 provided with a speech recognizing mean for receiving a speech to 
recognize it as a text, a command converting mean for checking the recognized text with a 
command text dictionary, and separating it into a command text and an object text, and a 
conversation control mean for having an access to, and receiving a service from a provider which 
provides different information based on the separated... information, and telephone information, the 
speech input system comprises speech input terminals 10, 30 provided with a speech input/output 
mean, and an access status display mean, a speech portal server 50 provided with a speech 
recognizing mean for receiving a speech to recognize it as a text, a command converting mean for 
checking the recognized text with a command text dictionary, and separating it into a command text 
and an object text, and a conversation control mean for having an access to, and receiving a service 

from a provider which provides different information based on the separated information, and 

telephone information, the speech input system comprises speech input terminals 10, 30 provided 
with a speech input/output mean, and an access status display mean, a speech portal server 50 
provided with a speech recognizing mean for receiving a speech to recognize it as a text, a 
command converting mean for checking the recognized text with a command text dictionary, and 
separating it into a command text and an object text, and a conversation control mean for having an 
access to, and receiving a service from a provider which provides different information based on the 

separated information, and telephone information, the speech input system comprises speech 

input terminals 10, 30 provided with a speech input/output mean, and an access status display mean, 
a speech portal server 50 provided with a speech recognizing mean for receiving a speech to 
recognize it as a text, a command converting mean for checking the recognized text with a 
command text dictionary, and separating it into a command text and an object text, and a 
conversation control mean for having an access to, and receiving a service from a provider which 

provides different information based on the separated information, and telephone information, 

the speech input system comprises speech input terminals 10, 30 provided with a speech input/output 
mean, and an access status display mean, a speech portal server 50 provided with a speech 
recognizing mean for receiving a speech to recognize it as a text, a command converting mean for 
checking the recognized text with a command text dictionary, and separating it into a command text 
and an object text, and a conversation control mean for having an access to, and receiving a service 
from a provider which provides different information based on the separated... Claims: A speech 
input system comprising:a speech input terminal provided with a speech input/output mean, a Web 
browser, and a display mean for displaying an access status to an external system and a search 
result;a speech portal server provided with a speech recognizing mean for receiving a speech from 
said speech input terminal to recognize it as a text, a command converting mean for checking the 
recognized text with a command text dictionary, and separating it into a command text and an 
object text, and a conversation control mean; andthe application service provider which is provided 

with an information search mean for searching information based on the command is claimed 

is;l. A speech input system comprising:a speech input terminal provided with a speech input/output 
mean, a Web browser, and a display mean for displaying an access status to an external system and a 
search result;a speech portal server provided with a speech recognizing mean for receiving a speech 
from said speech input terminal to recognize it as a text, a command converting mean for checking 
the recognized text with a command text dictionary, and separating it into a command text and an 
object text, and a conversation control mean; andthe application service provider which is provided 

with an information search mean for searching information based on the command is claimed 

is:l. A speech input system comprising: a speech input terminal provided with a speech input/output 



mean, a Web browser, and a display mean for displaying an access status to an external system and a 
search result;a speech portal server provided with a speech recognizing mean for receiving a speech 
from said speech input terminal to recognize it as a text, a command converting mean for checking 
the recognized text with a command text dictionary, and separating it into a command text and an 
object text, and a conversation control mean; andthe application service provider which is provided 

with an information search mean for searching information based on the command conversation 

between said speech input terminal and said application service provider based on the provided 
speech; wherein said speech portal server is provided withspeech recognizing means for receiving 
speech from said speech input terminal to recognize it as a text,command converting means for 
checking the recognized text with a command text dictionary, extracting a command text from the 
recognized text, and determining the text other than the extracted command text as an object text 
or conducting a morpheme analysis to remove postposition from the text other than the extracted 
command text and to determine it as the object text, andconversation control means for sending the 
separated command text and said object text to said application service provider, anda speech 
input/output mean, a Web browser, and a display mean for displaying an access status to an external 
system and a search result;a speech portal server provided with a speech recognizing mean for 
receiving a 

speech from said speech input terminal to recognize it as a text, a command converting mean for 
checking the recognized text with a command text dictionary, and separating it into a command text 
and an object text, and a conversation control mean; andthe application service provider which is 

provided with an information search mean for searching information based on the command 

different information; anda speech portal server which controls a conversation between said speech 
input terminal and said application service provider based on the provided speech, and is provided 
with a speech recognizing means for receiving a speech from said speech input terminal to 
recognize it as a text comprising character strings, a command converting means for checking the 
recognized text with a command text dictionary, and separating it into a command text and an 
object text, and a conversation control means for sending the separated command text and object text 
to said application service provider, and providing said speech input terminal with information 
searched by said application service provider;wherein said speech recognizing means is provided 
with a connected speech recognizing means for recognizing speech sound in a sentence constructed 
with a succession of a plurality of words in a relatively longer utterance, a word speech recognizing 
means for recognizing speech sound in a relatively shorter utterance and a comprehensive 
recognition evaluating means for selecting either of recognition results of said two recognizing 

' means based on a speech characteristic value provided as a threshold conversation between said 

speech input terminal and said application service provider based on the provided speech; wherein 
said speech portal server is provided withspeech recognizing means for receiving speech from said 
speech input terminal to recognize it as a text,command converting means for checking the 
recognized text with a command text dictionary, extracting a command text from the recognized 
text, and determining the text other than the extracted command text as an object text or 
conducting a morpheme analysis to remove postposition from the text other than the extracted 
command text and to determine it as the object text, andconversation control means for sending the 
separated command text and said object text to said application service provider, and.!. ... is claimed 
is: 1 . A speech input system comprising: a speech input terminal provided with a speech input/output 
mean, a Web browser, and a display mean for displaying an access status to an external system and a 
search result;a speech portal server provided with a speech recognizing mean for receiving a speech 
from said speech input terminal to recognize it as a text, a command converting mean for checking 
the recognized text with a command text dictionary, and separating it into a command text and an 
object text, and a conversation control mean; andan application service provider which is provided 



with an information search mean for searching information based on the command.. 
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Voice controlled computer interface for manipulating pointing device, digitizes and processes 
user utterances, to enable recognizer software to generate command string for operating 
system manipulation 
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Alerting Abstract ...NOVELTY - A voice navigator box (1 12) receives user utterances through a 
microphone (114) and digitizes and processes the signals such that the digital samples conform to 
small computer systems interface (SCSI) protocols. A recognizer software (120) runs on the 
operating system (132) to process the digitized signals to generate a command string (121) 
representing commands for actions to be taken or operating system manipulation. ... method for 
converting voice utterances to commands; method for generating a table for aiding conversion of 



voice utterances; and method for enabling a user to create an instance in a formal language having 

strictly defined syntax device, to control motion of a cursor on the computer screen and to 

indicate desired actions associated with the position of the cursor on the display. A user can easily 
control the graphical interface and can switch between voice utterances that correspond to 
commands for actions to be taken, and voice utterances that correspond to text strings to be... Class 
Codes International Patent Classification IPC Class Level Scope Position Status Version Date 
...G10L-0015/26 ...G10L-0015/00 Original Publication Data by AuthorityOriginal Abstracts: 
Voice utterances are substituted for manipulation of a pointing device of the kind which is 
manipulated to control motion of a cursor on a computer display and to indicate desired actions 
associated with the position of the cursor on the display. Voiced utterances are converted to 
commands, expressed in a predefined command language, to be used by an operating system of a 
computer, by converting some voiced utterances into commands corresponding to actions to be taken 
by the operating system, and converting other voiced utterances into commands which carry 
associated text strings to be used as part of text being processed in an application program running 
under the operating system. A table is generated for aiding the conversion of voiced utterances to 
commands for use in controlling an operating system of a computer to achieve desired actions in an 
application program running under the operating system. ...Claims: for manipulation of a pointing 
device, the pointing device being of the kind which is manipulated to control motion of a cursor on a 
computer display and to indicate desired actions associated with the position of the cursor on the 
display, the cursor being moved and the desired actions being aided by an operating system in the 

computer in response to control signals received from the pointing device, the computer system 

being separately responsive to control signals received from the keyboard in accordance with a 
predetermined format specific to the keyboard, the system comprising a voice recognizer for 
recognizing a voiced utterance, and an interpreter for converting the voiced utterance into 
control signals which will directly create a desired action aided by the operating system in the 
computer without first being converted into control signals expressed in the predetermined format 
specific to the... 
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Control system for telephony system, reproduces stored message including identifier for 
identifying caller, using recognition result generated by speech recognition user interface 
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Control system for telephony system, reproduces stored message including identifier for 
identifying caller, using recognition result generated by speech recognition user interface 
Alerting Abstract ...NOVELTY - A control unit controls telephony system so as to reproduce a 
stored image, using the recognition result generated by a speech recognition user interface. The 
message is stored with an identifier, for identifying caller. The control unit displays a list of message 
to be reproduced upon the instruction to reproduce identified message. ... USE - For controlling 
telephony system using speech recognition system such as facsimile, telephone/printer, photocopier 
and human/machine interface... Class Codes International Patent Classification IPC Class Level 

Scope Position Status Version Date G10L-0015/12 G10L-0015/12 G10I^0015/12 G10L- 

0015/22 G10MJ015/22 G10I^0015/22 G10I^0015/00 G10I^0015/00 G10L- 

0015/00 G10L-0015/00 Original Publication Data by Authority ...Original Abstracts:by 

processing the reference patterns on the active list, and a new active list 205 is generated for the 
succeeding input pattern. The amount of processing required for each pattern on the active list is 
reduced by using a pointer which identifies the reference pattern which is the earliest in the sequence 
of patterns of the current reference signal listed on the new active list during the processing of a 
preceding dynamic programming path. In a second aspect, a speech recognition interface is used as 
a control system for a telephony system. Claims: A control system for controlling a telephony 
system, comprising: a speech recognition user interface for allowing a user to input speech 
commands for controlling the telephony system, said speech recognition user interface comprising: 
(i) means for receiving an input speech command; (ii) means for storing a plurality of reference word 
models; (iii) means for comparing the input speech command with the stored reference word models 
to generate a recognition result; and control means, responsive to the recognition result generated 
by said speech recognition user interface, for controlling the telephony system in accordance with 
the input speech command; wherein said control means controls the telephony system so as to 
reproduce a stored message using the recognition result generated by said speech recognition user 
interface; wherein said message is stored with an identifier for identifying the caller that has left the 
message; and wherein said control means is operable to display a list of messages to be reproduced 



upon instruction to reproduce the message identified by said identifier A control system for 

controlling a telephony system, comprising: a speech recognition user interface (321) for allowing a 
user to input speech commands for controlling the telephony system, said speech recognition user 
interface comprising: (i) means (321) for receiving an input speech command; (ii) means (321) for 
storing a plurality of reference word models;(iii) means (321) for comparing the input speech 
command with the stored reference word models to generate a recognition result; andcontrol 
means (325, 331), responsive to the recognition result generated by said speech recognition user 
interface (321), for controlling the telephony system in accordance with the input speech command; 
wherein said control means (325, 331) is operable to control the telephony system so as to reproduce 
a stored message using the recognition result generated by said speech recognition user interface 
(321); wherein said message is stored with an identifier for identifying the caller that has left the 
message; and wherein said control means (325, 331) is operable to display a list of messages to be 
reproduced based on the recognition result generated by said speech recognition user interface 
(321) and the identifier, whereby the user is able to request, via said speech recognition user 
interface, the control means to replay messages from a particular caller... 
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Audio browsing system has converter that transforms command into page address so that 
analyzed user's voice can be displayed on browser based on converted page address 
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Audio browsing system has converter that transforms command into page address so that 
analyzed user's voice can be displayed on browser based on converted page address 

...NOVELTY - The system has a microphone (22) through which user's voice is input. A controller 
(12) has an analyzer (12) for processing the input voice of the user as a command. A converter (52) 
transforms the command into a page address so that the analyzed user's voice can be displayed on a 
browser based on the converted page address. Title Terms .../Index Terms/Additional Words: 



DISPLAY; Class Codes International Patent Classification IPC Class Level Scope Position Status 
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control signal to directly produce desired action by operating system in computer 
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Alerting Abstract ...NOVELTY - A voice recognizer is provided to recognize a voiced utterance. 
An interpreter (130) converts the voiced utterance into control signal to directly produce a desired 



action by an operating system in a computer without being converted into control signals expressed 

in a a voiced utterance converting method;and a voice utterance table generating 

method. Class Codes International Patent Classification IPC Class Level Scope Position Status 
Version Date . G 10L-0015/26 ...G10Lr0015/00 Original Publication Data by AuthorityOriginal 
Abstracts: Voice utterances are substituted for manipulation of a pointing device of the kind which is 
manipulated to control motion of a cursor on a computer display and to indicate desired actions 
associated with the position of the cursor on the display. Voiced utterances are converted to 
commands, expressed in a. predefined command language, to be used by an operating system of a 
computer, by converting some voiced utterances into commands corresponding to actions to be taken 
by the operating system, and converting other voiced utterances into commands which carry 
associated text strings to be used as part of text being processed in an application program running 
under the operating system. A table is generated for aiding the conversion of voiced utterances to 
commands for use in controlling an operating system of a computer to achieve desired actions in an 
application program running under the operating system. ...Claimsrfor manipulation of a pointing 
device, the pointing device being of the kind which is manipulated to control motion of a cursor on a 
computer display and to indicate desired actions associated with the position of the cursor on the 
display, the cursor being moved and the desired actions being aided by an operating system in the 

computer in response to control signals received from the pointing device, the computer system 

being separately responsive to control signals received from the keyboard in accordance with a 
predetermined format specific to the keyboard, the system comprisinga voice recognizer for 
recognizing a voiced utterance, andan interpreter for converting the voiced utterance into control 
signals which will directly create a desired action aided by the operating system in the computer 
without first being converted into control signals expressed in the predetermined format specific to . 
the... 
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Voice control of server stored page capable of being downloaded for display on clients 
terminal, server has dictionaries associated with page download, client identifies relevant 
dictionary(s) and loads it/them into his terminal 
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Voice control of server stored page capable of being downloaded for display on clients 
terminal, server has dictionaries associated with page download, client identifies relevant 
dictionary(s) and loads it/them into his terminal Original Titles:VOICE-OPERATED 
SYSTEM FOR CONTROLLING A PAGE STORED ON A SERVER AND CAPABLE OF BEING 
DOWNLOADED FOR DISPLAY ON A CLIENT DEVICE ... Alerting Abstract ...NOVELTY - 
The server memory (1), holding the page, contains a dictionary for one or more vocal links including 
for each link an audio voice recording for voice control. The client terminal is designed to download 
into memory each page associated dictionary. The voice recognition system on the clients terminal 

(2) includes a program (E) which compares the audio recording corresponding to the voice command 
with the recording(s) of each associated page dictionary. ...terminal (2) which can exchange data with 
a remote server (1) via a telecommunication network (3). The system includes means (11,12) for 
recording a voice command given by the user, and voice recognition means which enable a recorded 
voice command to automatically determine and control the execution of an action associated with 
the voice command. The voice link has several voice command audio recordings one at least of 
which is a female voice recording and male voice recording. The server holds data identifying the 
associated dictionary(s). The client terminal detects the data at the moment of page display and when 
detected sends a request to the server so that the latter sends back to it the dictionary identified by the 
data 7 processing and navigating unit E speech recognition Title Terms . ./Index 

Terms/ Additional Words: DISPLAY; Class Codes International Patent Classification IPC Class 
Level Scope Position Status Version Date ...G10L-0015/22 ...G10L-0015/00 Original Publication 
Data by Authority... Original Abstracts: with a remote server (1) via a telecommunications network 

(3) and which, on the other hand, includes means (11, 12) permitting the recording of a voice 
command spoken by a user, and voice recognition means making it possible, from a recorded voice 
command, to determine and command automatically the execution of an action associated with 
this voice command. The server (1) has in its memory, in association with said page (page (l).htm,. 

.. ), at least one dictionary (page (l).ias,. .. ) of one or more voice links including for each voice 

at least one audio recording of the voice command; the client device is capable of downloading into 
its memory each dictionary associated with the page, and the voice recognition means of the client 
device (2) comprise a voice recognition program (E) that is designed to make a comparison of the 
audio recording corresponding to the voice command with the audio recording or recordings of 

each dictionary associated with the page voice-operated control of a page designed to be 

displayed on a client device (2), which can exchange data with a remote server (1) via a 



telecommunication network (3), and which comprises means (11,12) for recording a voice command 
spoken by a user, and speech recognition means enabling, from a recorded voice command, to 
determine and automatically control the execution of an action associated with said voice command. 
The server (1) comprises in storage, connected with said page (page (l).htm), at least a dictionary 
(page (l).ias) of one or several voice links comprising for each voice link at least an audio recording 
of the voice command; the client device is adapted to download in storage each dictionary associated 
with the page, and the speech recognition means of the client device (2) comprise a speech 
recognition programme (E) which is designed to carry out a comparison of the audio recording 
corresponding to the voice command with the audio recording(s) of each dictionary associated with 

the page moyens (11, 12) permettant l'enregistrement d'une commande vocale prononcee par un 

utilisateur, et des moyens de reconnaissance vocale permettant, a partir d'une commande vocale 
enregistree, de determiner et de commander automatiquement Texecution d'une action associee a 
cette commande vocale. Le serveur (1) comporte en memoire, en lien avec ladite page (page (l).htm, 
), au moins un dictionnaire (page (l).ias, ) d'un ou plusieurs liens vocaux comprenant pour chaque 
lien vocal au moins un enregistrement audio de la commande vocale; le dispositif client est apte a 
telecharger en memoire.. .Claimsrhand, can exchange data with a remote server via a 
telecommunications network and which, on the other hand, includes means permitting the recording 
of a voice command spoken by a user, and voice recognition means making it possible, from a 
recorded voice command, to determine and command automatically the execution of an action 
associated with this voice command, characterised in that the server has in its memory, in 

association with said page, at least one dictionary of one or more voice links including the voice 

command, in that the client device is capable of downloading into its memory each dictionary 
associated with the page, and in that the voice recognition means of the client device comprise a 
voice recognition program that is designed to make a comparison of the audio recording 
corresponding to the voice command with the audio recording or recordings of each dictionary 
associated with the page. 
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Voice activated navigation system for a vehicle can be operated using different languages 

Patent Assignee: ALPINE ELECTRONICS INC (ALPN); ALPINE KK (ALPN); S ATOH K (S ATO- 

I) 

Inventor: SATO K; S ATOH K 



Patent Family ( 8 patents, 28 countries ) 



Patent Number 


Kind 


Date 


Application 
Number 


Kind 


Date 


Update 


Type 


EP 1146316 


A2 


20011017 


EP 2001302788 


A 


20010326 


200226 


B 


JP 2001296882 


A 


20011026 


JP 20001 14244 


A 


20000414 


200226 


E 


US 20010037203 


Al 


20011101 


US 2001832645 


A 


20010411 


200226 


E 


EP 1146316 


Bl 


20050413 


EP 2001302788 


A 


20010326 


200525 


E 


DE 60109979 


E 


20050519 


DE 60109979 


A 


20010326 


200535 


E 









EP 2001302788 


A 


20010326 






DE 60109979 


T2 


20060223 


DE 60109979 


A 


20010326 


200615 


E 








EP 2001302788 


A 


20010326 






JP 3908437 


B2 


20070425 


JP 20001 14244 


A 


20000414 


200730 


E 


US 7277846 


B2 


20071002 


US 2001832645 


A 


20010411 


200765 


E 



Priority Applications (no., kind, date): JP 2000114244 A 20000414; EP 2001302788 A 20010326 



Patent Details 



Patent Number 


Kind 


Lan 


Pgs 


Draw 


Filing Notes 


EP 1146316 


A2 


EN 


3 


2 




Regional Designated 
States, Original 


AL AT BE CH CY DE DK ES FI FR GB GR IE IT LI LT LU LV MC MK 
NL PT RO SE SI TR 


JP 2001296882 


A |JA . 


12 








EP 1146316 


Bl PEN 










Regional Designated 
States,Original 


DE FR GB 


DE 60109979 


E 


DE 






Application 


EP 2001302788 












Based on OPI patent 


EP 1146316 


DE 60109979 


T2 


DE 






Application 


EP 2001302788 












Based on OPI patent 


EP 1146316 


JP 3908437 


B2 


JA 


16 




Previously issued patent 


JP 2001296882 



Alerting Abstract ...NOVELTY - A speech recognition unit (2) is used together with a language 
determining unit (3) to determine the language used by a speaker. The navigation system can 
display map information of the vicinity of the vehicle. Even when two languages are mixed, the unit 
can operate using the more frequently used language. The guiding speech from the unit will be 

generated using this language. ...2 speech recognition 3 speech recognition dictionary storage 

unit Class Codes International Patent Classification IPC Class Level Scope Position Status Version 

DateG10L-015/00 ...G 101^00 13/00 G10L-0015/00 G10L-0015/00 G 101^0015/00. .. 

.G10L-0015/10 G10L-0015/10 G10L-0015/28 .G10L-0013/00 G10L-0015/00. . 

...G101^0015/00 G10L-0015/00 Original Publication Data by AuthorityOriginal Abstracts: A 

navigation system characterized by including speech-recognition devices 2 and 3 for performing 
speech-recognition processing on input speech, language determining devices 4 and 5 for 
determining a language used by a speaker of this input speech based on the contents of the input 
speech recognized by the speech-recognition devices 2 and 3, and navigation processing devices 
6, 7, 8, 10, 1 1, 12, and 13 for performing a navigation operation corresponding to the language 

determined by the language determining devices 4 and 5 A navigation system includes a 

speech-recognition unit for performing speech-recognition processing on input speech. The system 
also includes a language-determining unit for determining the language of a speaker based on the 
contents of the input speech as recognized by the speech-recognition unit.The system further 
includes a navigation-processing unit for performing a vehicle-installed- type navigation operation 

corresponding to the language as determined by the language-determining unit A navigation 

system includes a speech-recognition unit for performing speech-recognition processing on input 
speech. The system also includes a language-determining unit for determining the language of a 



speaker based on the contents of the input speech as recognized by the speech-recognition unit. 
The system further includes a navigation-processing unit for performing a vehicle-installed-type 
navigation operation corresponding to the language as determined by the language-determining unit. 
Claims: A navigation system characterized by comprising speech-recognition means 2 and 3 for 
performing speech-recognition processing on input speech;language determining means 4 and 5 for 
determining a language used by a speaker of this input speech based on the contents of said input . 
speech recognized by said speech-recognition means 2 and 3; andnavigation processing means 6, 
7, 8, 10, 11, 12, and 13 for performing a navigation operation corresponding to said language 

determined by said language determining means 4 and 5 A navigation system 

comprising:speech-recognition means (2, 3) for performing speech-recognition processing on 
input speech; characterized bylanguage determining means (4, 5) for determining a language used 
by a speaker of this input speech based on the contents of said input speech recognized by said 
speech-recognition means (2, 3); andnavigation processing means (6, 7, 8, 10, 11, 12, 13) for 
performing a navigation operation corresponding to said language determined by said language 
determining means(4, 5 What is claimed is:l. A navigation system compn sing: speech- 
recognition means for performing speech-recognition processing on input speech spoken by a 
speaker ;language-determining means for determining what language said input speech is spoken in 
based on the contents of said input speech as recognized by said speech-recognition means; 
andnavigation-processing means for performing a vehicle-installed-type navigation operation 

utilizing the language of a speaker as determined by said language-determining means What is 

claimed is: 1 . A navigation system comprising: speech-recognition means for performing speech- 
recognition processing on input speech spoken by a speaker;language-determining means for 
determining a language in which said input speech is spoken based on the contents of said input 
speech as recognized by said speech-recognition means;navigation-processing means for 
performing a vehicle-installed-type navigation operation utilizing the language-as determined by 
said language-determining means;image recognition means for determining the contents of the 
characters included in an inputted image of a road sign; andan intersection guiding means for 
storing intersection records, generating an intersection guiding image corresponding to the specified 
language, and generating guiding speech data corresponding to the specified language;wherein each 
intersection record includes a plurality of intersection name strings in different languages; 
andwherein said navigation-processing means includes a guiding means for replacing the 
characters, whose contents are determined by said image recognition means, with other characters 
having substantially the same meaning in the language determined by said language-determining 
means, and for displaying characters or outputting speech. 
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Speech recognition method for activating a hyperlink of text information on Internet pages 
displayed to user, by determining respective phoneme sequences of corresponding determined 
hypertext of the hyperlink in text information 
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Speech recognition method for activating a hyperlink of text information on Internet pages 
displayed to user, by determining respective phoneme sequences of corresponding determined 
hypertext of the hyperlink in text information Original Titles:SPEECH RECOGNITION 

METHOD FOR ACTIVATING A HYPERLINK OF AN INTERNET PAGE Speech 

recognition method for activating a hyperlink of an internet page SPEECH RECOGNITION 

METHOD FOR ACTIVATING A HYPERLINK OF AN INTERNET PAGE Alerting Abstract 
DESCRIPTION - Hypertexts (HT(HL)) of hyperlinks (HL) are determined in the text information 
(TI), and corresponding phoneme sequences of the hypertexts. A spoken command is received from 
user which represents part of determined hypertext. A second phoneme sequence corresponding to 
spoken command is determined and hyperlink selected by user is determined by comparing the 
second and first phoneme sequences to activate selected hyperlink. One quality value is determined 

for each hypertext. AN INDEPENENT CLAIM is made A speech recognition device for 

controlling an Internet browser, which is arranged for receiving text information containing a 
hyperlink of Internet page; A computer program product, which can be loaded directly into the 

internal memory of a digital computer and containing five software code USE - For activating a 

hyperlink of an Internet page by speech recognition method or apparatus. ADVANTAGE - Invention 
provides improved speech recognition method and device and an improved computer program 
productTitle Terms .../Index Terms/Additional Words: DISPLAY; Class Codes International Patent 
Classification IPC Class Level Scope Position Status Version Date G10L-015/28 Main ...G10L- 
0015/00 G10L-0015/06 G10L-0015/08 G10L-0015/22 G10L-0015/28 G10L- 



0015/00 Original Publication Data by AuthorityOriginal Abstracts: In a speech recognition method 
for activating a hyperlink (HL) of text information (TI) of an Internet page (8) displayed for a user, 
the method comprising the following steps: determining hypertexts (HT(HL)) of the hyperlinks (HL) 
in the text information (TI) shown and determining corresponding first phoneme sequences 
(PI1 [HT(HL)]) of these hypertexts (HT(HL)) and receiving a spoken command (AS) from a user 
which represents at least part of a determined hypertext (HT(HL)), and determining a second 
phoneme sequence (PI2) corresponding to the spoken command (AS) and determining the 
hyperlink (HL) selected by the user by comparison of the determined second phoneme sequence 
(PI2) with determined first phoneme sequences (PI1 [HT(HL)]) and activating the selected hyperlink 

(HL), where one quality value (QV) is determined A speech recognition method is disclosed for 

activating a hyperlink of an Internet page. In particular the method comprises determining hypertexts 
of the hyperlinks in text information, determining corresponding first phoneme sequences of 
hypertexts, receiving a spoken command from a user, determining a second phoneme sequence 
corresponding to the spoken command, determining the hyperlink selected by the user using the 
first and second phoneme sequences, activating the selected hyperlink, where one quality value is 

determined for each hypertext when first phoneme sequences are determined and where In a 

speech recognition method for activating a hyperlink (HL) of text information (TI) of an Internet 
page (8) displayed for a user, the method comprising the following steps: determining hypertexts 
(HT(HL)) of the hyperlinks (HL) in the text information (TI) shown and determining corresponding 
first phoneme sequences (PI1 [HT(HL)]) of these hypertexts (HT(HL)) and receiving a spoken 
command (AS) from a user which represents at least part of a determined hypertext (HT(HL)), and 
determining a second phoneme sequence (PI2) corresponding to the spoken command (AS) and 
determining the hyperlink (HL) selected by the user by comparison of the determined second 
phoneme sequence (PI2) with determined first phoneme sequences (PI1[HT(HL)]) and activating 
the selected hyperlink (HL), where one quality value (QV) is determined for each hypertext 

(HT(HL)) when informations textuelles (TI) presentees et la determination des premieres 

sequences correspondantes de phonemes (PI1[HT(HL)]) de ces hypertextes (HT(HL)) et la reception 
d'une commande vocale (CV) enoncee par un utilisateur qui represente au moins une partie d'un 
hypertexte determine (HT(HL)), et la determination d'une deuxieme sequence de phonemes (PI2) 
correspondant a la commande vocale (CV) suivie de la determination de l'hyperlien (HL) 
selectionne par l'utilisateur resultant de la comparaison de la deuxieme sequence de phonemes 
determinee (PI2) avec la premiere sequence de phonemes determinee (PI1[HT(HL)]) et Tactivation 
de Thyperlien selectionne (HL), dans laquelle une valeur de qualite... Claims: We claim: 1. A speech 
recognition method for activating a hyperlink of text information of an Internet page displayed to a 
user, the method comprising the following steps: determining hypertexts of the hyperlinks in the text 
information shown and determining corresponding first phoneme sequences of these hypertexts and 
receiving a spoken command from a user which represents at least part of a determined hypertext, 
and determining a second phoneme sequence corresponding to the spoken command and 
determining the hyperlink selected by the user by comparison of the determined second phoneme 
sequence with determined first phoneme sequences and activating the selected hyperlink, wherein 
one quality value is determined for each hypertext when first phoneme sequences corresponding to 
the hypertexts... 
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Data processing system for enhancing human communication in hospital, communicates fields 
of translational records of language partition column, to user in accordance with client controls 
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Alerting Abstract ...processing mode. Fields of another set of records of language partition column 
referenced by host partition index are communicated to user based on user invoked display buttons 
(116,118,120,122,124,126,128,130,132,134,136,138). DESCRIPTION - The data processing system 
has a set of client controls for navigating the user interface, display buttons 
(116,118,120,122,124,126,128,130,132,134,136,138), tabular translation rules unit, mode control 
unit, client media control unit and host media control unit. Reference of each record is associated to 
one of display buttons which are invoked by user. Each display button is an individual visible 
mechanical button distinguishable by physical touch without inadvertent invocation. The button 
includes a dedicated display unit integrally provided for dynamically displaying a number of 
purposes for manually invoking button. The tabular translation rules unit contains a number of 

translation member has a number of translation records in correspondence with language partition 

columns. Each translation record contains a number of fields of completely constructed visual or 
audible representation of dialogue. The mode control unit selects a current processing mode of data 
processing system from several different user interface execution algorithms. The client media 
control unit maintains client partition index referencing one of the language partition columns. The... 
...there is a reasonably predictable dialogue. Manages a personalized set of data through editing any 
of translation rules. Communication is enabled through the most simple action, such as pressing of a 
button. Provides a learning tool for people who want to sharpen their language skills, or sharpen 

skills on one or more other languages 1 1 6, 1 1 8, 1 20, 1 22, 1 24, 1 26, 1 28, 1 3 0, 1 3 2, 1 3 4, 1 3 6, 1 3 8 

Display buttons Class Codes International Patent Classification IPC Class Level Scope Position 
Status Version Date G10L-017/28 Main Original Publication Data by AuthorityClaims: A data 
processing system for enhancing human communications, comprising: a set of client controls for 
navigating the user interface of said data processing system; a plurality of display buttons, each 
button of said plurality of display buttons being an individual visible mechanical button 
distinguishable by physical touch without inadvertent invocation, wherein said button includes a 
dedicated display means integrated thereon for dynamically displaying a plurality of purposes for 



manually invoking said button, and wherein said button is capable of bearing a physical touch 

communication; tabular translation rules means least one host partition index referencing a 

language partition column of said language partition columns; means for communicating to a user 
through said plurality of display buttons said plurality of fields for a set of first translation records of 
a language partition column referenced by said client partition index, in accordance with said client 
controls and said current processing mode; means for associating a reference of each record of said 
set of first translation records to a button of said plurality of display buttons; means for determining 
with said reference a selected translation record of said set of first translation records upon a user 
invoked button of said plurality of display buttons; and means for communicating to a user through 
said user invoked button said plurality of fields of a second translation record of the translation 
member row containing said selected translation record, said second translation record of a language 
partition column referenced by said host. . . 
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Dynamically generated speech grammar system for voice controlled web browser Alerting 
Abstract ...NOVELTY - A speech specific parser performs interactive control from a script 



describing a web page. A visual speech hint generator adds a visual speech hint to the web page 
script A speech grammar compiler generates a speech grammar for each hyper link and visual 
control, which is translated into compiled speech grammar. DESCRIPTION - A grammar based 
speech recognition engine translates the speech grammar into a compiled speech grammar and 
determines a speech event from a voice audio input device. A rendering engine executes the speech 
event on a visual web page rendered from the modified web page script. An INDEPENDENT 

CLAIM is DESCRIPTION OF DRAWINGS - The figure shows the flow diagram illustrating the 

process of displaying a web document and handling a voice command. Class Codes International 

Patent Classification IPC Class Level Scope Position Status Version Date G10L-0015/22 G10L- 

0015/26 G10L-0015/00... Original Publication Data by Authority... Original 
Abstractsrimplementing a voice-controlled Web browser program executing on a wearable 
computer is disclosed. A Web document is received (602) at the wearable computer, and processed 
to dynamically generate a speech grammar (604). The speech grammar is used to recognize voice 
commands (616) at the wearable computer. Alternatively, a Web document is precompiled at a 
server computer to generate a speech grammar, and the speech grammar is transmitted with its 
corresponding Web document to the wearable computer. The wearable computer provides three 

mechanisms for a user to navigate Web pages by the use of. mechanism,the user can speak the 

test of the hyperlink to activate the hyperlink (616). In a third mechanism, the user invokes a 
command to display a dialog window having a list of hyperlinks and their corresponding index 
values. The user can speak an index value or a hyperlink to activate the hyperlink (618... 
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Alerting Abstract ...NOVELTY - An electronic reading aid (29) is mounted on top of a video 
display (31). The aid includes a camera (37) that has a field of view onto the printed material (33) to 
be read. The images obtained by the camera are transmitted to a computer (35) which extracts text 
and graphics information so that enhanced images can be displayed for the partially sighted. Text is 
also converted into synthesized speech (47) based on user commands that are generated by hand 
and finger gestures. Audio and tactile feedback is provided to the user. ...31 Video display Class 
Codes International Patent Classification IPC Class Level Scope Position Status Version Date 
...G10L-0013/04 ...G10L-0013/00 Original Publication Data by Authority Original Abstracts:text 
to be read (37, 51), on which the user performs finger- and hand-based gestural commands, are input 
to a computer, which decodes the text images into their symbolic meanings through optical 
character recognition (55), and further tracks (57) the location and movement of the hand and 
fingers in order to interpret the gestural movements into their command meaning. In order to allow 
the user to select text and align printed material, feedback is provided to the user through audible 
and tactile means. Through a speech synthesizer, the text is spoken audibly. For users with 
residual vision, visual feedback (71) of magnified and image enhanced text is provided.Multiple 
cameras of the same or different fields of view can improve performance. In addition, alternative 

device configurations allow portable operation, including the use of cameras located Images of 

the text to be read, on which the user performs fingerand hand-based gestural commands, are input to 
a computer, which decodes the text images into their symbolic meanings through optical character 
recognition, and further tracks the location and movement of the hand and fingers in order to 



interpret the gestural movements into their command meaning. In order to allow the user to select 
text and align printed material, feedback is provided to the user through audible and tactile means. 
Through a speech synthesizer, the text is spoken audibly. For users with residual vision, visual 
feedback of magnified and image enhanced text is provided. Multiple cameras of the same or 
different field of view can improve performance. In addition, alternative device configurations allow 

portable operation, including the use of cameras located on text to be read (37, 51), on which the 

user performs finger- and hand-based gestural commands, are input to a computer, which decodes the 
text images into their symbolic meanings through optical character recognition (55), and further 
tracks (57) the location and movement of the hand and fingers in order to interpret the gestural 
movements into their command meaning. In order to allow the user to select text and align printed 
material, feedback is provided to the user through audible and tactile means. Through a speech 
synthesizer, the text is spoken audibly. For users with residual vision, visual feedback (71) of 
magnified and image enhanced text is provided.Multiple cameras of the same or different fields of 
view can improve performance. In addition, alternative device configurations allow portable 
operation, including the use of cameras located on worn platforms, such as eyeglasses (100), or on a 
fingertip system (123). The use of gestural commands is natural, allowing for rapid training and ease 
of use. The device also has application as an aid in learning to read, and for data input and image 
capture for home and business uses... Claims: A method for electronically reading text under 
interactive control by a user, the method comprising:obtaining a first digital image of at least a 
portion of the text to be read;performing symbology recognition on the first digital 
image;capturing a temporal sequence of digital images of user-generated spatial configurations of 
at least one pointer;determining a command signal from the temporal sequence of digital 
images;choosing a subset of the recognized symbology to read at a spatial location on the text to be 
read on the basis of the determined command signal;feeding back to the user the spatial location of 
the chosen subset to be read and the spatial location of at least one pointer relative to the 
recognized text, wherein the spatial location of the chosen subset and the pointer aids the user in 
selecting the text to be read; andconverting the chosen subset of recognized symbology into a 
humanly understandable version. 
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Voice recognition based document editing method in word processing, spreadsheet application 
Original Titles :Method and apparatus for editing documents through voice recognition. Alerting 
Abstract ...NOVELTY - Voice recognizing unit (46) of a data processing system with a CPU and a 
graphical display, receives words comprising a command portion and a target portion from user 
through microphone. The command portion and targets such as sentence, word or paragraph is 
recognized by the CPU and a numerical emblem is sequentially displayed at the recognized target 
positions. ...DESCRIPTION OF DRAWINGS - The figure shows portion of data processing system 

including voice recognition components 46 Voice recognizing unit Class Codes International 

Patent Classification IPC Class Level Scope Position Status Version Date ...G10L-0015/22... 
...G10L-0015/26 . G10L-0015/00 Original Publication Data by AuthorityOriginal Abstracts:The 
present invention provides a method and apparatus that places visual markers onto the display of a 
document on a graphical user interface. The visual markers assist in the navigation around the 
document by numbered artifacts. Claims:Claim 6. A display system for displaying visible artifacts 
in a document in a data processing system, the display system comprising: a. a central processing 

unit;b. a voice recognition unit to provide command input to the central processing unit a 

graphical display to provide visible output from the central processing unit;d. the voice recognition 
unit providing means for receiving a voice order by the voice recognition unit, the voice order 
comprising a command portion and a target portion; ande. means within the central processing 
uniti. for parsing the command portion of the voice order to determine locations within the 
document of targets designated by the voice order;ii. for ascertaining within the document the 
location of each target determined by the step of parsing; andiii. for placing an artifact on the display 
at the location of each target ascertained. 
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Voice controlled video display for interactive appts. enabling hands-free operation - has device 
for recognising set if data links which connect documents for display on video screen, and 
compiler receives text from interpreter and produces corresp. codes which are merged with 
finite operation commands 
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Voice controlled video display for interactive appts. enabling hands-free operation has 

device for recognising set if data links which connect documents for display on video screen, 
and compiler receives text from interpreter and produces corresp. codes which are merged 
with finite operation commands ...Original Titles:System and method for voice controlled video 

screen display System and method for voice controlled video screen display SYSTEM FOR 

MAKING VIDEO SCREEN DISPLAY VOICE INTERACTIVE TYPE System and method 

for voice controlled video screen display. Alerting Abstract ...The system for providing voice 
interaction for a video screen display has as device for recognising a set of data links associated 
with the display. A device recognises voiced utterances corresponding to the data links and 

converts them to natural language text An evaluator analyses the natural language text and 

converts it to programming commands for changing the screen display in response to the 
recognised data link Title Terms .../Index Terms/Additional Words: DISPLAY; Class Codes 
International Patent Classification IPC Class Level Scope Position Status Version Date ...G10L- 

015/22 G10L-007/08 G101^007/10 Main ...G10L-0 15/00 G10L-003/00 Original 

Publication Data by AuthorityOriginal Abstracts: A system and method for a voice controlled video 
screen display system are provided. The voice controlled system is useful for providing "hands-free" 
navigation through various video screen displays such as the World Wide Web network and 



interactive television displays. During operation of the system, language models are derived from 

incoming data in applications such as the World Wide Web network A system and method for a 

voice controlled video screen display system are provided. The voice controlled system is useful for 
providing "hands-free" navigation through various video screen displays such as the World Wide 
Web network and interactive television displays. During operation of the system, language models 
are provided from incoming data in applications such as the World Wide Web network. Claims: 1 . A 
system for making a video screen display voice interactive comprising: 
means for recognizing a set of data links associated with a video screen display; 
means for recognizing voiced utterances corresponding to said data links and converting said 
voiced utterances into natural language text; and 

means for evaluating natural language text received from said means for recognizing voiced 
utterances and converting said natural language text to programming commands for changing the 

video screen display in response to said recognized data link A voice-interactive video screen 

display system comprising:means for identifying and interpreting a set of data links (104), each data 
link associated with visual information concurrently displayed on a video screen;means for 
recognizing voiced utterances (106) corresponding to the displayed visual information on said 
video screen and the associated data link, and generating corresponding natural language text, said 
visual information having been displayed prior to the voiced utterances being spoken; andmeans 
for evaluating said natural language text and converting said natural language text to programming 
commands (112) for changing the video screen display in accordance with said data link 
corresponding to said recognized voiced utterances. Systeme daffichage d'ecran video a interaction 
vocale comprenantun moyen pour identifier et interpreter un ensemble de liaisons de donnees (104), 
chaque liaison de donnees etant associee a des informations visuelles affichees simultanement sur un 
ecran video;un moyen pour reconnaitre des enonces voises (106) correspondant aux informations 
visuelles affichees sur ledit ecran video et a la liaison de donnees associee, et generer un texte en 

langage naturel A voice-interactive video screen display system comprising: means for 

recognizing a set of data links, each data link associated with visual information concurrently 
displayed on a video scrcen;means for recognizing voiced ulterances corresponding to the displayed 
visual information on said video screen and the associated data link, and generating corresponding 
natural language text, said visual information having been diplayed prior to the voiced utterances 
being spoken; andmeans for evaluating said natural language text and converting said natural 
language text to programming commands for changing the video screen display in accordance with 
said data link corresponding to said recognized voiced utterances.> 
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Pattern matching method with apparatus for speech recognition in telephony system - uses 
dynamic programming, defines active patterns and stores reference values, then updates 
cumulative values and propagates programming path using pointer which identifies earliest 
reference pattern on new active list 
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information issued from audio output such as speakers which issues voice commands generated 
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Guidance device with voice command for word processors e.g. computers has sound 

information issued from audio output such as speakers which issues voice commands generated 
through phonetic generator Alerting Abstract ...The device is assembled with an input (101) to 
control the cursor and pointer appearing on a display screen. A position detector (102,103) is 

furnished to detect the position of the cursor or pointer on the screen to correspond to a position 

such as a tone and a voice. The correspondence is made through a phonetic control information 
memory (107). The tone, voice, etc. is formed by an installed phonetic generator (105). The sound 

information is issued through an audio output (111) such as a speaker USE/ADVANTAGE - 

Facilitate guidance of word processor with voice command through addition of SUI environment 
(Sound User Interface) to GUI environment providing dynamic operation environment. Class Codes 
International Patent Classification IPC Class Level Scope Position Status Version Date ...G10L- 
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PROBLEM TO BE SOLVED: To provide an information processor, a printer, a printing system and 
a printing method for converting information regarding speech into character information in the 
case that the information regarding the speech is included in hyperlink information within a hypertext 
inputted through a network. 

SOLUTION: At the time of displaying the hypertext inputted through the network 208 and a network 
interface part 207 by a browser 210 at a display part 202 and printing the hypertext in this printer 
220, in the case that the information regarding the speech is included in the hyperlink information 
within the hypertext, a speech recognition part 205 converts the information regarding the speech 
into the character information and a printing command generation part 206 generates a printing 
command and transmits the character information to the printer 220. 
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